Picture for Lei Xie

Lei Xie

Nanjing University

Integrating Fine-Grained Audio-Visual Evidence for Robust Multimodal Emotion Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech

Add code
Jan 26, 2026
Viaarxiv icon

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition

Add code
Jan 25, 2026
Viaarxiv icon

S$^2$Voice: Style-Aware Autoregressive Modeling with Enhanced Conditioning for Singing Style Conversion

Add code
Jan 20, 2026
Viaarxiv icon

WenetSpeech-Wu: Datasets, Benchmarks, and Models for a Unified Chinese Wu Dialect Speech Processing Ecosystem

Add code
Jan 16, 2026
Viaarxiv icon

VoiceSculptor: Your Voice, Designed By You

Add code
Jan 15, 2026
Viaarxiv icon

The ICASSP 2026 Automatic Song Aesthetics Evaluation Challenge

Add code
Jan 12, 2026
Viaarxiv icon

Invisible Walls: Privacy-Preserving ISAC Empowered by Reconfigurable Intelligent Surfaces

Add code
Jan 08, 2026
Viaarxiv icon

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

Add code
Dec 16, 2025
Viaarxiv icon

Adaptive Matched Filtering for Sensing With Communication Signals in Cluttered Environments

Add code
Dec 09, 2025
Viaarxiv icon